Title: A Bayesian Approach to Discovering Truth from Conflicting Sources for Data integration Conference: VLDB 2012
نویسنده
چکیده
Truth discovering is an interesting problem in data integration. In practical data integration system, it is common for the data sources being integrated to provide conflicting information about the same entity, thus raises the truth finding problem. The authors propose a Bayesian approach, the latent truth model, to solve the problem. The authors also conduct experiments regarding to both effectiveness and efficiency.
منابع مشابه
A Bayesian Approach to Discovering Truth from Conflicting Sources for Data Integration
In practical data integration systems, it is common for the data sources being integrated to provide conflicting information about the same entity. Consequently, a major challenge for data integration is to derive the most complete and accurate integrated records from diverse and sometimes conflicting sources. We term this challenge the truth finding problem. We observe that some sources are ge...
متن کاملA Probabilistic Model for Estimating Real-valued Truth from Conflicting Sources
One important task in data integration is to identify truth from noisy and conflicting data records collected from multiple sources, i.e., the truth finding problem. Previously, several methods have been proposed to solve this problem by simultaneously learning the quality of sources and the truth. However, all those methods are mainly designed for handling categorical data but not numerical da...
متن کاملIntegrating Conflicting Data: The Role of Source Dependence
Many data management applications, such as setting up Web portals, managing enterprise data, managing community data, and sharing scientific data, require integrating data from multiple sources. Each of these sources provides a set of values and different sources can often provide conflicting values. To present quality data to users, it is critical that data integration systems can resolve conf...
متن کاملInformation Integration: The MOMIS Project Demonstration
1 Overview The goal of this demonstration is to present the main features of a Mediator component, Global Schema Builder, of an I3 system, called MOMIS (Mediator envirOnment for Multiple Information Sources) 1]. MOMIS 12 has been conceived to provide an integrated access to heterogeneous information stored in traditional databases (e.g., relational, object-oriented) or le systems, as well as in...
متن کاملData Warehouse Configuration
In the data warehousing approach to the integration of data from multiple information sources, selected information is extracted in advance and stored in a repository. A data warehouse (DW) can therefore be seen as a set of materialized views defined over the sources. When a query is posed, it is evaluated locally, using the materialized views, without accessing the original information sources...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013